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STOCHASTIC GROWTH RATES FOR LIFE HISTORIES WITH 
RARE MIGRATION OR DIAPAUSE 


DAVID STEINSALTZ AND SHRIPAD TULJAPURKAR 


Abstract. The growth of a population divided among spatial sites, with mi¬ 
gration between the sites, is sometimes modelled by a product of random 
matrices, with each diagonal elements representing the growth rate in a given 
time period, and off-diagonal elements the migration rate. If the sites are 
reinterpreted as age classes, the same model may apply to a single population 
with age-dependent mortality and reproduction. 

We consider the case where the off-diagonal elements are small, representing 
a situation where there is little migration or, alternatively, where a determin¬ 
istic life-history has been slightly disrupted, for example by introducing a rare 
delay in development. We examine the asymptotic behaviour of the long-term 
growth rate. We show that when the highest growth rate is attained at two 
different sites in the absence of migration (which is always the case when mod¬ 
elling a single age-structured population) the increase in stochastic growth rate 
due to a migration rate e is like (loge“^)“^ as e 4.. 0, under fairly generic con¬ 
ditions. When there is a single site with the highest growth rate the behavior 
is more delicate, depending on the tails of the growth rates. For the case when 
the log growth rates have Gaussian-like tails we show that the behavior near 
zero is like a power of e, and derive upper and lower bounds for the power in 
terms of the difference in the growth rates and the distance between the sites. 


1. Introduction 

1.1. Biological motivation. If a population is divided among spatial sites, with 
distinct fixed growth rates at each site, with no migration between sites, the num¬ 
bers in the best site will become overwhelmingly larger than those at the other sites, 
and the overall population growth rate will be determined by the rate prevailing at 
the best site. Introducing migration between sites, as Karlin showed [Kar82], will 
always reduce the long-run growth rate of the total population. 

Karlin’s theorem assumes deterministic growth, den Boer [dB68] argued that 
migration may increase long-run growth when there is independent or weakly cor¬ 
related stochastic variation in growth among sites. In a different context, it has 
long been argued [Col54] that populations of individuals who delay or spread repro¬ 
duction over time will suffer reduced growth rate. But Cohen [Coh66] and Cohen 
and Levin [CL91] used analysis and simulations to show that long-run growth of 
a population could increase as a result of a life cycle delay when there are some 
kinds of random variation in time, or by migration when there are some kinds 
of random variation across space. These kinds of stochastic variation have been 
formulated as random matrix models whose Lyapunov exponent is the long-run 
growth rate of the population, as discussed by [TWOO, WT94]. In this general 
setting, we would like to know whether the long-run growth rate increases when 
there is mixing in space and/or time [TWOO] — biologically, when should migration 
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and/or delay be favoured to evolve? A general and precise answer has been difficult 
because previous work [WT94] shows that the long-run growth rate can be singular 
(e.g., non-differentiable) in the limit of no mixing. A similar singularity arises in 
random-matrix models used in models of disordered matter ([DH83]). 

Here we consider a random-matrix model of migration among sites whose indi¬ 
vidual growth rates vary stochastically over time, and characterize the behavior of 
the Lyapunov exponent in the limit of zero migration. As we show, this model can 
be used to study a number of models of migration, life cycle delay, or a combination 
of these. Our results address evolutionary stability (in a fitness-maximising con¬ 
text) of a small amount of mixing, via migration or life-cycle delays. We find that 
when the growth rates at different sites are similar, the population with a small 
positive migration rate will profit from the variability between sites, to grow faster 
than would have been possible even at the best single site. In particular, when the 
growth rates are equal — which is always the case in the diapause setting — the 
sensitivity of stochastic growth rate to changes migration rate is extreme, varying 
near 0 like l/loge“^. 

Our results complement a recent analysis of spatially stochastic growth in [ERSS12]. 
They use diffusions to model migration among sites that have independently vary¬ 
ing stochastic growth rates and characterize the migration rate that maximizes the 
long-run stochastic growth rate. 

1.2. The mathematical problem: Migration. Suppose Di,D 2 ,... is an i.i.d. 

sequence of d x d diagonal matrices. We write for the diagonal 

elements of Dt, and assume := log^l*^ all have finite mean fXi and finite 

variance. We order them so that is the largest. We also write := /xq — Hi- 

We let At be an i.i.d. sequence of nonnegative d x d matrices with zeros on the 
diagonal, and we assume that the pairs (Dt, At) are all independent. We define the 
migration graph M to be a directed graph whose vertices are the sites {0,..., d— 1}, 
possessing an edge {i,j) if essinf E[log At(j, f) | Dt] > —oo. That is, we do not 
assume that At and Dt are independent, but for the migration steps we consider to 
be possible we assume that there is a lower bound to how close At{j,i) can come 
to 0 that is independent of all Dt- It follows trivially that when (z,j) is an edge 
E[logAi(j, f) I D] > —oo where CD is the cr-algebra generated by {Dt, 1 <t < oo}. 
We assume that M is connected. 

We let A be a fixed diagonal matrix with entries Aq, ..., A^-i. (Generally we 
will be thinking of A as the growth or survival penalty for migration or diapause, 
so that the entries will be negative, but this is not essential.) We assume the 
penalty acts multiplicatively on growth — this seems reasonable from a modeling 
perspective, and elegantly avoids the problem of negative matrix entries — and is 
proportional to e. We define 

Dt{e) := Dt + eAt- 
We also write A*^®) := Aq — A^. 

For e > 0 the i.i.d. sequence Dt{e) satisfies the conditions for the existence of 
a stochastic growth rate independent of starting condition. [Coh79] That is, if we 
define the partial products 

'■= Dxie) ■ Dt-i{() .-Di(e) 
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then 

a(e) := lim T'Mog 

are well defined deterministic quantities, in the sense that the limit exists almost 
surely, is almost-surely constant, and is the same for any 0 < i,j < d — 1. 

Of course, Rt{0) is not so simple. The off-diagonal terms are all 0, while on the 
diagonal, by the Strong Law of Large Numbers, 

lim logi?T(0)i,j = Ilf 
T —¥00 

We take a(0) := /tq = lim£^oa(e)- (That is, the growth rate of all matrix entries 
when e > 0 is small will be dominated by the largest growth rate in a single diagonal 
entry. This will be an elementary consequence of the stronger results that will be 
stated in section 1.7, and proved in the sequel.) 


1.3. Variation of the mathematical problem: Diapause. Consider a pop¬ 
ulation in which individuals progress through immature life stages until reaching 
adulthood, when they reproduce and then die. Diapause is a life-cycle delay in 
which individuals can stay in some immature stage with some probability. We can 
describe diapause by reconceptualizing the “sites” of the previous section as life 
stages, and also describe an organism’s progress using matrices that are not diago¬ 
nal, but sub-diagonal. The life stages (or sites) are viewed as a cycle, described by 
matrices of the form 


Mt 


/ 0 0 
0 

0 


0 Bt\ 
0 0 

0 0 


V 0 0 • • • 0 / 


Here ages run from 1 to d, and are equivalently referred to as age classes that run 
from 0 to d — 1. The quantity € (0,1) is the proportion surviving from age 
j to j -|- 1 in year t, and Bt is the average number of offspring produced when an 
individual becomes mature in age-class d — 1. Offspring are born into age-class 0, 
and the parent — in age class d — 1 — dies.To this we add eA, where now H is a 
fixed diagonal matrix with nonnegative entries, and at least one positive entry, and 
also allow for penalties 
We immediately have 


.. d-2 

(1) a(0) = - (E[log Bt] + ^ E[log . 

3=0 

If we look at this in groups of d generations, the product 

Dt :=( e'’'^ Mdt+d-i + e^dt+d-i) ( Mdt+d-2 + iAdt+d- 2 )''' { ^dt + eAdt) 

is diagonal when e = 0, and is of the form described in section 1.2. 

Consequently, we may apply Theorem 3 to this Dt, producing the same 1 / log 
rate of increase, as stated in Corollary 4. 
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1.4. The effect of the penalty A. We will mostly be concerned with analyzing 
the case A = 0. For most purposes, A has no effect. But this is not always true. 

The crucial point is that the effect of A is always nearly linear in e, while the 
increase of a near 0 is often super linear, growing either as l/loge“^ or as for 
a power /3 < 1. In either case, the rapid increase in a near 0 will be qualitatively 
unaffected by a linear term for e sufficiently small. Even when the linear term is 
negative (as we will generally be assuming it to be), the growth rate a will still be 
increasing on a small interval of e > 0. 

On the other hand, as discussed in section 1.7 in some cases we cannot exclude 
the possibility that the growth rate when A = 0 is qualitatively like with /3 > 1. 
If /3 > 1 and Aq < 0 then a will be decreasing near 0; if /3 = 1 then a more sensitive 
analysis would be required. 

Since the upper and lower bounds on the appropriate power of e in Theorem 5 are 
distinct, with the lower bound on the growth rate (the upper bound on the power 
of e) being sometimes larger than 1, it will not always be possible to determine 
whether the change in growth rate increases or decreases for infinitesimal changes 
in e. 


1.5. Orlicz norm and sub-Gaussian variance factor. Let (x) = e®^/5. Fol¬ 
lowing [Pol90] we define the Orlicz norm ||.^||>p for a centered random variable Z 
by 


(2) ||Z||^:=inf{G : E[vI/(|Z|/G)] < 1}. 


In [BLM13] a random variable Z is said to be sub-Gaussian if it has finite variance 
factor t{Z), defined as 

(3) t{Z) := inf {a > 0 : E [e^^] < e“^'/2 VA G m} . 

(The square-root of this is called the sub-Gaussian standard in [BKOO].) We also 
introduce the term subvariance to denote 


(4) 


T^{Z) := liminf 
2—>-00 


-21ogP{Z > z} ’ 


Note that the variance factor may be thought of as an upper bound on the scale of 
the tails, while the subvariance is a lower bound. When Z is not centered (that is, 
E[Z] 7 ^ 0) we apply the definition to Z — E[Z]. 

We point out here that the assumption that Xt = log(5(^VCt°^) bave nonzero 
subvariance implies what may be considered exceptionally heavy tails for the growth 
rates — effectively, something like log-normal. This is what is required for a nontriv¬ 
ial lower bound in Theorem 5. Thus, it seems plausible to infer that the population 
will obtain no long-term benefit from sending occasional individuals to a site with 
lower average growth, unless the low average growth is compensated by fat positive 
tails, meaning that there is a small chance of a very large payoff. (These heavy 
tails may also be generated if puts too much probability near 0 — that is, a 
population crash.) 


Lemma 1. A sub-Gaussian centered random variable Z satisfies 


(5) 

m ,<^: 

(6) 

T*{Z) < t(Z) < oo 
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If Z is Gaussian with mean 0 and variance then 

(7) T*(Z)=r(Z) = a2. 

Proof. The statement (7) is trivial. 

If T = t{Z) is finite then for any X,z,S > 0, 

P{|Z| >z} 

Taking A = z/{t + 6), we have P{|2'| > z} < e~^ which implies 

(8) ¥{\Z\> z} 

since 6 is arbitrary. This immediately proves (6). 

Integrating by parts, we have for C > '/2t, 



0 /*C)0 


II c = 



then this bound is 5, proving (5). 


□ 


The Orlicz norm is a norm, in the sense that the Orlicz norm of an arbitrary 
sum of random variables is no greater than the sum of the Orlicz norms. For 
independent sub-Gaussian random variables Xi,..., X}. the variance factors are 
also sub-additive. 


Lemma 2. For any independent centered sub-Gaussian random variables Xi,..., Xu, 

(9) <^r(Xi), 

and 

(10) > a:| < exp| — 

Also 

1/2 

( 11 ) \\Xi + ■ ■ ■ + XkU < i^T{Xi)) . 

//maxr(Xi) < r then 

( 12 ) 

and 


(13) 


||Xi -l- • • • -l- Xfe||,p < 



Proof. Statement (9) is Lemma 1.7 of [BKOO], and (10) follows by (8). The remain¬ 
der follows by Lemma 1. □ 
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1.6. Some basic assumptions. For the upper bounds on growth rate (see Theo¬ 
rem 5) we will be assuming 

v(i)_v(l) _ v(o) 

with mean —are sub-Gaussian, writing := -I- We will also 

denote the variance factor of itself by We assume all of these are finite. 
We also define the corresponding subvariances and Thus 

A2f0-) 

< - . 

- 2 

We write 


(14) 


logE 




(j)l 


fj,A ■■= max E[log+ A(j, i) - 

■= Mhj) - 

0<t,j<d—l 


We use log_|_ At(i, j) := 0 V log At{i,j), because the negative part may actually be 
infinite. We will use these expressions only for defining an upper bound on the 
growth increase induced in a migration step by the matrix elements for 

which increasing At(i,j) arbitrarily will only improve the bound. 


1.7. Main results. 


1.7.1. Same average growth rate at two distinct sites. 

Theorem 3. The modulus of continuity of a at e = 0 is bounded by 1/loge“^. 
That is, 

(15) hmsup(loge“^)(a(e) — a(0)) < oo. 

If there exists a site j such that min/i^-l) = 0 and is not almost surely zero, 
then a has modulus of continuity l/loge“^ ot e = 0. That is, 

(16) 0 < liminf(loge“^)(a(e) — a(0)) < limsup(loge“^)(a(e) — a(0)) < oo. 

GO V I eio A / 

Notice that this is a fairly generic result, as the lower bound does not depend 
on any assumptions about the tails. The upper bound does depend on the sub- 
Gaussian assumption for the logarithms of the matrix entries, meaning that heavy¬ 
tailed distributions — including, but not exclusively, those that are sub-exponential 
[Teu75], so a fortiori entries with polynomial order tail behaviour — could have an 
even slower convergence to 0 as e approaches 0. 

1.7.2. Rare diapause. 

Corollary 4. In the diapause setting with Mt being not deterministic — that is, 
at least one entry has nonzero varianee — a has modulus of eontinuity l/loge“^ 
at e = 0. That is, 

(17) 0 < liminf(loge“^)(a(e) — a(0)) < limsup(loge“^)(a(e) — a(0)) < oo. 

GO V I eio A / 
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1.7.3. Distinct growth rates. For the rest of this section we assume that only site 
0 has the maximum growth rate. Then a still increases for small values of e > 0 
when Aq = 0, but the growth is only like a power of e. Whether this remains true 
for Aq < 0 will depend on the exact power, since the change on the diagonal may 
produce a decrement linear in e, so that a(e) will only be increasing in e near 0 if 
the growth in a(e) is at a rate faster than linear — that is, if the power is smaller 
than 1. 

We do not know exactly which power it is, but Theorem 5 gives upper and lower 
bounds. The bounds are monotonically increasing in (that is, the larger the 
gap between the best-site growth rate and the next best, the smaller the benefit of 
migration); and decreasing in the variability of the difference between entries 
However, for this purpose the “best site” is determined by a combination of mean 
difference and variance, and the distance from the best site, in a way that will be 
defined precisely below. 

For the upper bound on growth (that is, the lower bound on the power of e) we 
will need the sub-Gaussian variance factor (defined in section 1.5), which provides 
an upper bound on the tails, hence on the variability of the differences. For the 
lower bound we need the sub-variance of xj:^\ defined in (4). We do not need 

to assume that the sub-variance is positive, but when all sub-variances are 0 the 
power of e in the lower bound is oo, hence the lower bound is trivial. When is 
positive, this implies that the Fenchel-Legendre transform 


(x) := sup Ax — log E 
A>0 



is bounded by 

( 18 ) r’H.) < -A 


In the special case where xj:^^ are normally distributed, the subvariance and the 
lower standard will both be equal to the standard deviation. In other cases the 
“best” sites determining the two bounds may be distinct. 

We write X for the complete collection of all X^ ' for j = 0,1,..., d — 1, 1 < t < 
oo. 

If d = 2 we have an upper bound that a(e) —a(0) is smaller than 
and lower bound A. _ Por d > 2 this becomes slightly more complicated for 
two reasons: First, the growth will be dominated by one dimension that has the 
fastest growth; second, the increment to growth will be smaller if direct transition 
between the best two sites is impossible. For this purpose, for each I < j < d — I 
we define Kj to be the smallest length of a cycle in M that starts and ends at 0, and 
passes through j. (Thus Kj > 2, and is equal to 2 when A{Q,j) > 0 and A{j, 0) > 0 
both with positive probability.) Define also 


(19) 


pU) (i 


U) 

Ttb’ 


:= ^ 


(i) 


Theorem 5. Assume all > q, and conditions (14) and (18) hold. Let j be the 
site that minimises Kjp^J\ and / the site that minimises Kj'p^^ V(1 + ^)- V 

Aq = 0 then for any c' > 2 k j' there are positive constants C, C (depending on the 
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K, p, p*, d, fi-A, ta) such that for all e > 0 sufficiently small, 

(20) < a(e) - a(0) < C'. 


Suppose now Aq < 0. Then 

• If2Kjpi^^ < 1 then both bounds in (20) still hold; 


2k ■/^ (j) 

• If ■^_^ 2 pU') ^ 1 then the upper bound in (20) holds; 


2tv ■ / p^"^ ' 

If ■^_^ 2 pU') ^ ^ then a is differentiable at 0, with a'(0) = — Aq. 


,(/) 


Note that pi^'^ = oo when the distribution of is not heavy-tailed — for exam¬ 
ple, very natural choices such as gamma-distributed diagonal elements — making 
the lower bound on the left-hand side vacuous, but it remains an open question 
whether zero subvariance implies that the approach of a(e) to 0 is faster than poly¬ 
nomial in e. 


2. Excursion decompositions 

Since we are assuming the unique maximum average growth rate is at site 0, the 
maximum growth for the perturbed process will arise from rare excursions away 
from 0; in particular, from those that include the (not necessarily unique) site that 
minimises pn in (19). 

Define £ to be the set — called the excursions from 0 — of cycles in the migration 
graph that start and end at 0, with no intervening returns to 0. For an excursion 
e we write |e| for the length of the cycle minus 2 — that is, the number of time 
steps spent away from 0. 

For a given excursion e we define 

K{e) := {0 < t < |e| -b 1 : e* 7 ^ ej+i} 

K{e) := max^Kj : j € e}; 

p(e) := min Ip*-^^ : j € e| . 

Note that 0 and T are always in K{e), and the definition of implies that 
K{e) > ffK{e). We will refer to K(e) as the diameter of e. 

We write £t for the collection of sequences of excursions that can be fit into 
time {1,... ,T}. That is, an element e € £t has an excursion count k{e), such that 
each i e { 1 ,..., fc(e)} there is a pair {ti, e^) with ti € { 2 ,..., T — 1 } and Si G £ 
satisfying 

ti “b I < , 

tfc(e) “b |^fc(e)| A T. 

We write the total length of an excursion sequence as 

fc(e) 

l|e|| :=El®*l- 

i=l 

We also write £T-,k,n,m for the subset of £t comprising excursion sequences whose 
excursion count is k, whose total length is n, and the sum of whose diameters /t(ei) 
is m. The null excursion sequence is the element of £t with fc(e) = ||e|| = 0. 
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The (0,0) entry of the product Rt will be a sum of terms that are enumerated 
by elements of £t, corresponding to paths through the sites. We define new random 
variables as a function of the realizations of X and of A (the collection of all matrices 


( 21 ) 




log e + log At{j, i) - log - eAo if i ^ j, 
0 if i = j. 


Given an excursion e and a starting time to € {2 ,... ,T — |e|} we define the random 
variables 

(22) e[to;X, A] := ^ at+toi^t,et+i) + ^ - eAe^^ . 

t£K{e) te{l,...,|e|}\Ar(e) 


Of course, this sum may be —oo, if it includes a transition at which the corre¬ 
sponding entry of A is 0. But the assumptions imply that it is finite with nonzero 
probability if e € 5. Given an excursion sequence e = ((tj,ei))^_^ € £ti we define 


(23) e[X,A] :=^e,[t,;X,A]. 
The quantity we are trying to approximate is 

(24) 

Lemma 6. 


i(e) - a(0) = 1 ^logi?T(e)o.o “ • 


(25) log i?T (0,0) = y]Af^+eTAo +log (l+ ^ 

ee£T\{ 60 } 

where e° is the null excursion sequence. 


Proof. We have, by definition, 

i?T(0,0) 


T 

{xo,...,xt) t=l 


where the summation is over (xq, • ■ •, xt) € {0,... ,d — 1}^+^ with xq = xt = 0. 
Note that we may restrict the summation to (T-l-l)-tuples such that {xt, Xt-i) > 
0, which will only be true when {xt-i,xt) is an edge of M. Such sequences of states 
map one-to-one onto excursion sequences. The product corresponding to excursion 
sequence e = ((ti,ei))^_^ is 

fc-l-l ti k |ei| 

(26) n( n A*(0,0)) •n(n^t+t.((«*)*’ 

i=l i —1 t —1 


where to = 0 = T. 

We have Zl((0j0) = e^*' 
(26) as 


-i-eAo^ Thus, we may write the log of the expression in 


k |ei| 


+eAo) -^^log 


t=i 


i=l t=l 


^t+ti (ei)t+i) 

A+q(0,0) 


(27) 
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Noting that 

and for j ^ f, 
log 


log 




D*t+tShj) fU) _ T 


= logeAt{j,f) - logX(^°^ - eAo = at+uif,j)- 


DUtMO) 


Since K{ei) is precisely the set of t such that (e^)* ^ {ei)t+i, this means that (27) 
is precisely the same as X, A], which completes the proof. □ 

Thus 


(28) 

and 


log Rt{0, 0) - eTAo - V xf ^ > max e[X, A], 

i—i bGEt 


log i?T(0,0)-eTAo 
(29) 


<31ogr+ max (log #fT;fc,n,m + max e[X,A] 

l<k,n,m<T \ eGST-.k,^,^ 


Combining this with (24) yields the bounds we will use: 
(30) 

and 


a(e) — a(0) > eAg + liminf T ^ maxe[X, A], 

T^oo 


a{e) — a(0) < eAg 

(31) 


+ lim sup T max 

T—¥(X) l<fc,n,m<T 


(log #£T-k 


+ max e[X, A] j. 

eG£T-,k,-n,m 


3. Distinct growth rates 

3.1. Derivation of the upper bound. We prove the upper bound in (20). We 
may assume that A = 0, since decreasing A can only decrease a(e) — a(0). Similarly, 
we may replace At{i,j) by At{i,j) V 1 for any {i,j) e M. That is, we put a floor 
under those off-diagonal elements which are allowable migrations. This avoids the 
nuisance of having entries be sometimes 0, and again, an upper bound that holds 
under these conditions will hold a fortiori under the original conditions. 

We also begin by assuming that there is a single site — which we may call site 
1 without loss of generality — that minimizes both Kj and pj. Since we are only 
concerned with the behavior of a(e) as e 0, we will, without loss of generality, 
confine our attention to 

(32) 0 < e < . 

An element of £T:k,n,m may be determined by the following choices: 

(i) Choose k points out of T where the excursions begin, yielding no more 
than (^) possibilities; 

(ii) Choose k numbers for the lengths of the excursions that add up to n, 
yielding no more than (”) possibilities; 

(iii) Choose m — 2k timepoints within these excursions as times when there is 
a change of site, yielding at most {^^ 2 k) possibilities; 
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(iv) There are no more than d™ ways to choose the sites to which the excursions 
move at the m times when there is a change. 

A crude bound based on Stirling’s Formula is 


log 



< & + 6 log 


a 

V 


which holds for all positive integers b and 0 < a < 6, as long as we adopt the 
convention 0 • log 0 = 0- log oo = 0. Then 


^ fl fl T 

(33) log #£T-k,n,m < TO log d + (m - 2fc) log + fc log - + T log —. 

Claim 7. For any positive c' > 8ki, and any 

Z > g2«lPl/(l+2pi) . (logg-i)^', 


we have 


(34) limsupT ^logP{ max (e[X, A] + log >-zT} < 0 

T—^OO GGST;k,n,m. 


for all e > 0 sufficiently small. 


We prove this claim in section 3.3, and proceed here under this assumption. This 
means that 


E p 

T-To 



max 

l<k,n,m<T 




max 




< OO. 


By the Borel-Cantelli Lemma, this implies that with probability 1 this event occurs 
only finitely often. It follows that the limsup is smaller than z almost surely, and 
hence, by (31), that 

(35) a(e) - a(0) < 


It remains only to clear away the assumption that that Kj and pj are both 
minimized at site 1. We do this by stratifying the excursions further by their 
diameter (recall the definition from section 2). Define 

p{k) := min|pj : Kj < k}. 

If e is an excursion with diameter k, then any site j included in e has Kj < k, hence 
also Pj > p{k). Furthermore, 

(36) nun 2pjKj/{l + 2pj) = min 2p{k)k/ (1 + 2p{k)) 

l<j<a—1 2<K:<a—1 

The maximum in (31) may be written as a maximum over (fc 2 ,..., kd-i), rep¬ 
resenting the number of excursions whose diameter is 2,3,..., d — 1, with the con¬ 
straint X] = k. We write S^l. „ for the excursion sequences consisting of k 
excursions, all of which have diameter k; and £T-,(k^),n,m for tho set of excursion 
sequences that have exactly excursions with diameter k. Then £T-,(k^),n,m nat¬ 
urally includes the direct sum of £^]^^ „ 
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Using the general fact that the maximum of a sum is smaller than the sum of 
the maxima, 


max e 




X, A] = max 


Thus we have 


max 

l<k,n,m<T 


(log 


Y,k^ = k ee£- 


< max 


max 


max 


J2 k^=k g 


b[X,A] 


< > max max [X, Al. 


^(k) *^^ 7 -'. 


(J) 




max 

G^ST\k,n, 


e[X,A]) <E 


max 


max 


«(k) 


©(k) [X, 


Because all excursions in £^1-^ pass through only sites j with pj > p{k), 

the same argument used for the upper bound in (35) may be applied to show that 
almost surely 

limsupT-i max max e(„)[X, A] < c«,(log 

T-J-oo l<fc,.n,,m„<T e(,) 

It follows that for c := {d — 2) ■ maxc^ and d := maxc),, 
limsupT“^ max (log n m+ max e[X, All 

T-j-oo l<k,n,m<T\ ’ ’ ’ ee£T;fc,™,™ ' 

< c{loge~^y' 2kjPj/(i+2pj) 

by (36), which completes the proof. 


(i) 

3 . 2 . Derivation of the lower bound. We may assume that Kjpl attains its 
minimum at j = 1. We will write k and p» (with no superscript attached) for ki 
and pi^\ Let 0 = jo. Ji, J 2 , = 1, j/+i, • • -Jk-iJk = 0, be a cycle from 0 in 

M, passing through 1. We may fix a real number A* and p > 0 such that 

K— 1 

(37) p|E(logE°^ - logAt+,(j,, j,+i)) < kA*| > p. 

Assume that e < e“^ and r > log e“^. Defining fc = [T/mJ and m = [Kloge“^//il^lj, 
we will apply (30) by considering only excursions of length exactly m— 1, which pro¬ 
ceed exactly through the sequence of sites 0 = jo, ji,..., j/- 1 ,1,..., 1, jz+i,..., j^- 1 , j^ = 
0, where site 1 is repeated exactly m — k + 1 times. The basic idea is that the ex¬ 
cursion fills a time block of length m, proceeding as quickly as possible from 0 to 
1, remaining as long as possible at 1, and then returning to 0. 

We define the standard excursion Bq := (ji, ..., jj-i, 1, ..., 1, j/+i, ..., j^-i), 
with m — K + 1 repetitions of site 1; and an excursion sequence Bq consisting of 
those pairs {im -|- 1, b^) for which 

(38) Yi ■= Bo [£m + 1; X, A] > 0. 

That is, 60 is put together from identical excursions of form Bq which can start only 
at times £m + 1. Each one of the k possible excursions is included precisely when 
its contribution to the sum would be positive. 
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We have 


k-l 


e[X,A] = ^(y,) 


+ ■ 


e=o 


Since the excursion contributions Y( are all independent, combining (30) with the 
Strong Law of Large Numbers yields 


(39) 


a(e)-a(0 )> - eAq. 


m 


We now observe that for any i 

I-l K-l 




X, 


O^em+ijjiT ji+l) + (^(e+l)m-K+iijij ji+l) + E 


(1) 


i=0 


i=I 


Note that the at terms are independent of the terms. 

By (37), for any y > 0, 

"e > y} > pV^Z > y + K(loge“^ + A* - cAq) + {m - k + l)/i(^^|, 

where 

Z := y: (.?;«+y‘>) 

is a mean-zero sub-Gaussian random variable with sub-variance bounded below by 
{m — K+ 

By Cramer’s Theorem [DZ09, Theorem 2.2.3] and the bound (18), 
lim (m — K -I- 1) 

m—¥OQ 


-1 


m-K + 1) ^Z>y + 2y^^^-\ -(loge ^-f A*-|-cAq 


m — K + 1 


>-J(2/i(i) + y) 
p*(2/i(i) +y)^ 


> 


2/i(i 


We then have for any 5 > 0, for hxed y and m sufficiently large (that is, for e 
sufficiently small) 

j-Yf 'I f m(2/i(i) -h y)2p* 

P{->!,|>pexp|--m— 


If 


e < exp < — 


-,(1) 


2^/fIS K 
then by the definition m := [k log , 




< 7, log e 


-1 


it follows, by Lemma 9, that 




_P_ 

mp* 

P 


> -exp |—(2 kp*-I- d) loge 

mpt 

pK 


> 


loge-i' 


2kp,+5 
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Combining this with (23) completes the proof. 


3.3. Proof of Claim 7. Since the probability is decreasing in z, it will suffice to 
show the statement is true for 

(40) z = -(loge-i)"', 


where d is any constant larger than 2ki. 
We define 


C:= 


k 

T’ 



/3:= 


m 

J' 


That is, C is the rate of excursions per unit time; v is the average length of ex¬ 
cursions; and (3 is the average diameter of excursions. We have the constraints 
1 /^ > p > 1 and 1/C > (since ki is the minimum Kj, hence the minimum 

number of changes in each excursion). Then the bound (33) may be written as 


(41) 


< CT iP log d+{P-l)logv- {P- 2) log(^ - 2) - log C) • 


Suppose now we fix some element e of £T\k,n,rm and list all the states of all the 
excursions in order as ji,..., we have 


n 

E [e[X; A]] < m^A + fn log e — ^ 

i=l 

and the random variable Y := e[X; A] — E[e[X; A]] is sub-Gaussian with variance 
factor bounded by 

n 

t{Y) < + TTITA 

i=l 


by Lemma 2. 

For any a;, z > 0, by (8) 


n 

;[X,A] -ba;)) > zT} < p{y > E j^Ui) _|_ rnfiA + ni log e ^ + zT — a:| 


< expi — Z [ + TtlTA 


^ + W log e ^-bzT-x^ |. 


We are assuming that pj is minimized at j = 1, f^^i) < j so that 
log P{(e[X, A]+x))>zT} 

n \ ~ ^ ^ 

/ (i*) _|_ -mT" 1 1 I \ 


1 / 1 V- 

< - - — > -\-mTA 

2 \ Pi ^ 


+ m^A + ^ log e ^ + zT — x 

2=1 


-1 


2 
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Taking x = log^ST-k,n,m and substituting (41), we get 

(42) 

log P{ max (e[X, A] + log #£T-k,n,m)) > zT] 

< log + max log P{ (e[X, A] + log #5T;fe,n,m) ) > 2 ?"} 

< T sup sup sup C P log dS/ /X* — log ^ — (/3 — 2) log(/3 — 2) 

S>At./3 >ki 0<C<1 L 

- {28/+ 2ta(3)-^ (^5 + logC + ^ + (/3 - 2) log(/3 - 2) + /^(ci + log(^e)-i)^ , 

where 

/i* :=min/xl^l, p* :=max/il^l, 

Cl := - log d, 

n 

S' := ^ > vp^ > /X* 

It is important to note that our assumptions ensure that /x* > 0. 

We need to show that this supremum is negative. Since the limit as C ), 0 is —oo, 
it will suffice to show this for all possible values of the quantity in square brackets, 
which we denote by 

(43) 

0 := /3 logdS/^* —\ogz + log XX — (/3 — 2) log(/3 — 2) 

(^S - logxx + xt + log 0 + (/3 - 2) log(/3 - 2) + /3^ci + log(Se)“^^^ 

28/Pi + 2fA/3 

where u = z/C,. Here we have taken advantage of the fact that logC < 0. 

We consider now two different regions for S: 

(i) 8 > log^ e and 

(ii) p* < 8 < log^ e. 

We may assume, without loss of generality, that e is small enough that the bounds 
will be in the right order; that is, log^ e > /x». 

Case (i) We define /3 := (/3 — 2)/8 > 0, and rewrite 0 as 

0 = (5/3 + 2) log d8/ /X* — logz + log u — 8 P log 5/3 

(^5 - logXX + XX + log 2 + 5/3log5/3 + (5/3 + 2){ci + log(5e)“^^^ 

28/Pi + 2f_A5/3 + Ata 

Again, since we are only interested in showing that this expression must be negative 
and bounded away from zero, we may equally well analyze 0/5. We have then 
(_44) 

® = C2/3 - /3 log/3 + 5“^ (2c2 - logz + logxx) 

+ 5“^(2ci + XX — logxx + loge“^) + 5“^ log 2 : + /3(ci + log/3e“^) + 25“^ log 5^ 

2 /Pi + 2fA/3 + 4 ta/5 
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where C 2 := log(i//x*. 

We may find Cq such that for all 0 < e < eg, 

(45) loge < —2|ci| — 2, which implies /3(ci +log/3e“^) > —ee 

(46) loge < -y/2fAPi; 

2|c 2| + log(4/pi + 2 + ta) _1^ 

log^e (2pi + l)loge-i 4’ 


-Ci-1 


> -e 


-3. 


(48) 


(1 + 2c3)2 


< 2 c 3 , where C 3 := (4/pi + 2 t^)' 


C3(ci + loge-i) - 2 c2 

(49) C 3 (ci + loge“^) - 2 |c 2 | > 1. 

We consider the derivative of ©/S' with respect to u, which is 


1 

Su 


1 - 


1 +/3(ci + log/3e ^) + S ^(2ci + u — logM + loge ^) + S ^logz + 2S ^logS 


1/pi + taP + ^ta/S 

Because the numerator and denominator of the large fraction are both positive, 
0/S must be increasing for u G (0,1). For u > 1, using the bound (45), the 
derivative is bounded above by 


(u-l) 


1 

Su 


1-e 3 +/3(ci + log/3e ^) + S Mogz 
1-.- [u - 1) 


2/pi + taP 

which will be negative for u > p\ + 2 + fA- Thus 

+/3(ci +loge-i) +/31og/3) 


® < C2/3-^log/3- 


4/pi + 2 t^/3 


by (47). 

If /3 > 1 then 


® <-C 3 (ci+loge ^)^ + |c 2 |<-l 


by (49). 

If /3 G [0,1], we may bound this by 


® < C 2/3 - /31og/3- C 3 Q +/3(ci +loge ^) +/31og/3 


where C 3 is defined in (47). Given the bounds /31og/3 > —y P and $ < y/3 for 
/3 G [0,1], and given that ci + loge“^ > I, we obtain the bound 


(^1 +/3(ci + loge ^) + /3log/?) >i- 2 i/^ 


>-—2^/3 + 2 (ci + loge ^)P, 


and so 
0 
S 


< (^C2-2c3(ci+loge '^)^P+(l+2cz)\fp 
by (48). 




(1 + 2c3)2 


4 803(01 + log e“i) - 4 c 2 4 


-^<0 


Case (ii) In the remaining cases we simplify the dependence on /? by using 
(/? — 2 ) log(/? — 2 ) > — e“^ > —/? log 2 for /? > 2 , 
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to obtain 
(50) ^ 

0 < 0 := /3 log — logz + logu 

- {2S/pi + 2t^/ 3)“^ ^5” - logu + m + logz + /3(^ci + log(25'e)“^^ 


We assume e < eo, chosen such that for all such e, 


Ki 


(51) ^ loge ^ + Ni (ci - log2^«e) > 0; 

1 + 2pi 

(52) < 


Pi 


--1 -2pi+2’ 

(53) loge“^ > f^pi - Cl + log41oge“^; 

— 1 ^ Cg 


loge- 
-1 


(54) log log ( 


> 


(2pi + 1)(c'-2ni) 


(55) where cg := C 5 — 2pift)i(ci — log2) — 2pjf^Ki, 05 :=-h Ni log2(l/^*; 


Pi 


(56) loge ^ > max |8 + 2ci, 804(02 — logC 4 )} , 

(57) where 04 := 64 f— + ^ • 

VPi P* 


We have 


de _ 1 

du u 

_ 1 
u 


( / \ S'-logM + 'u + log 2 : + /3(ci-log25'-loge)\ 

-5T1w^ -j 

f 1 - Pi 


S'— logw + u + log 2 :e +/3(ci — log2S — (2pi + 1) ^loge) 

S + pitaP 


We know that u — log u > 0, and 

log 2 ;g- 2 Kipi/( 2 pi+i) ^ Q _ Jpg 2 S — (1 + 2 pi)“^ log e > Pita 


by the definition of z (40) and the upper bound on S in this case, respectively, 
which together imply that 

S — logM + U + + /3(ci — log2S — (2pi + 1)“^ loge) ^ ^ 

S + PitaP ~ 


It follows that 0 is increasing in u on (0,1) and decreasing in m on (1 + 1/pi, 00 ). 
Consequently, 

(58) ^ 

0 < 0 := plog2dS/p»—\ogz+— —(2S/pi + 2f_A/3)~^ 

Pi 



where we use the assumption (51), which implies that S+log z+P (ci — log 2Se) > 0. 

The next step is to optimize with respect to p. We replace P hy x = P — ki 
(which now takes values on [0, 00 ). Then in the notation of Lemma 8 we set 


A = log 


2dS 

p* 


= —S + 2f^Ki, 
Pi 


B— \oge ^+ci—log2S, C = S+ \ogz + KiB, D 


E = 2fA- 
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Observe that these are all positive, and by (53) 


C > S' + 


Ki 


2 + 4pi 


loge \ 


B > - loge ^ 


We then have 


BD-CE= —SB -2-fAS- 2 ta logz 
Pi 
2 


= —S [log e ^ + Cl - log 25* - f^pi] + - 
p^ 1 


AD < 

< S 


Pi 

> 0 by (53); 
2S 


A taPiKi 

+ 2pi 


loge 


Pi 

loge“^ 
2 

< BC. 


2d 


+ 2 f^Ki ) ( log—+ 21 ogloge ^ 


Thus (62) holds, and we may conclude that 0 is decreasing in x on (0, oo). In 
particular, the maximum over the allowable range [ki, oo) is attained at j3 = ki. 

We now know that the maximum value of 0 is bounded by 
(59) 


sup 0 < sup Ki log S — 

s 


(-S' + log 


Z + KiCi — Ki 


log2S'ey 


< 2 ^^i log loge ~ 


2S/„ + 2f,». -log- + « 

(5 + Cl + iq^ loge“- - Ki log-loge--j 


28/Pi + 2 taki 

where C 5 is dehned in (55). 

By Lemma 8 (applied now with S in the role of x, and ^4 = 0), 
sup 0 < ( 2 ki - c')logloge“^ + ^^^i^loge”^ + C 5 


- log z + C 5 


1 + 2pi 

- 2 pi ( Ki(ci - log 2 ) + 


Ki 


log e — (2tii — d) log log e 


-1 


1 + 2pi 

= {2pi + 1)(2 ki - c') log log e“^ + Cg. 

where Cg is dehned in (55). By (54) this is negative, completing this Case. 


We conclude by hlling in the details that connect 0 to the quantity we are trying 
to bound. Putting both cases together, we see that there is a constant (expressible 
in terms only of c', pi, ki,/«», d, positive eg and r > 0 such that for any hxed 
e € ( 0 ,eo), and every 

2 : > g2Kipi/(l+2pi) 1^8 ki + 1 

the supremum of 0 is less than —r. 

Holding e G (0, eg) hxed, write 

D27-n rj. QiS,l3,u) 

R(U) := —T sup sup sup-. 

/3>ki u>U ^ 


2plfAKi 
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By the above argument, we know that R{U) > 0 for any U > z. According to (42), 
we have 

lim r“^logP| max (e[X, A] + log n m)) > ^7") < sup supsup-©(S',/3, u) 

ee£T;fc.„.™ ■ - ’ / S p u u 

= —zR{z), 

(since ( = z/u) so we need to show that R{z) > 0. For any U' > U > z, 

S([/)>min{^,S([/')}, and R{U')>^R{U). 

Thus, we will be done if we can show that R{U) > 0 for any U. 

Take U = C 4 max{ 2 c 2 , —log 2 , 04 }, where C 4 is defined in (57), and also large 
enough that . Then for any u>U, and any l3 > 0 and any S > /x*, by (44) 

0 < S (c 2 P - /31og/3^ + 2 c 2 - logz + logM - 2 S“^C 4 ^ (^S(l + 2/3 log e“^) + , 

using the facts that 

u — logu + logz > m/4, 

/3(ci + log/3e“^) > —0.02 + ^ loge“^ by the first part of (57), and 
l + 2S-MogS>0.27. 

Note that the bound (57) on eg and the definition of U imply that for u > 17 
and e < eg 

C 2/3 - /3 log /3 - C4 (^1 + 8/3 log <0, 
u~^ ( 2 c 2 - logz + logu) < 4 c4 
We then have for all u > [/, 

- < -C 4 ^ f- + 2'^) < -2^204 1 < 0, 

u \u b) 

so that R{U) > 0, completing the proof. 


Lemma 8. For real A,B,C,D,E, with D,E > 0 and > AE, the expression 
Ax — {Bx + j{Ex + D) is eoncave over x > —E/D and achieves its maximum 
over X in this range — so, including, in particular, all positive x — at 


(60) 


Xq 


\CE-BD\ - BD 
BE 


where B := y/B'^ — AE. The maximum value attained on this interval is 
(61) 

{Bx + C f _ 1 j-2{B + B){CE - BD) - ADE ifCE>BD, 

®>o ^ ~ Ex + D ~ fa' \ 2{B-B) {BD - CE) - ADE if CE < BD. 


If 

(62) 


A C B 


B ^ D^ E 


then xo < 0, and the function is decreasing in x for all x > 0.3 
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Proof. Setting y = Ex + D, we may rewrite the function as 


A 


AD 


f(.y) = -bv- -FT -y [-Fy 


E 

22 


B 


E- 




f BD 


E^ 


V E 


C] y-^- 


2BC 2B^D 


E 


E^ 


AD 

If 


This is a concave function that goes to —oo at t/ = 0. If < 0 then it is 
asymptotic to a line with positive or zero slope as y —)■ oo, so it is increasing for all 
y > 0 (which includes all x > 0). If > 0 then it goes to —oo as y —>■ oo. The 
first two derivatives are 


f'iy) 




-2 


ffy) = -2 




V E 


c 



showing that the second derivative is negative for y > 0, hence the function is 
concave. The first derivative is zero at yo := \CE — BD\/B, which is equivalent to 
(60). The maximum value is 


/(yo) 


2B 

If 


\CE-BD\ 


2B 

If 


{CE - BD) 


AD 

ff' 


which simplifies directly to (61) 

Suppose now that (62) holds. Since B > B — CE — BD < 0 implies 


Xo 


BD-CE- BD AD - BC 

-^^- < 0 . 

BE BB 


□ 


Lemma 9. Let Z be a random variable with tail bound 

V{Z> z]> ^ > 0. 


Then 


Proof. 


2ezo 


^OO 

E[Z+] = / F{Z > z}dz 
Jo 

pOC) 

/ ^-z{2zo+z)/tz 

Jo 

> 0 * ^-z,{2zo+z.)/tz 


for any fixed z* > 0, since the integrand is decreasing. Taking z* = tzI2zq gives 
us the result. □ 
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4. Equal rates 

The equal rates case is, on the one hand, easier, because the average “cost” of 
an excursion is zero, hence there is no need to keep track of the time spent on 
an excursion. On the other hand, this also makes the problem more challenging, 
because there is a much larger class of paths that need to be considered. When 
the average rates are unequal, only paths that spend nearly all their time at 0 can 
even plausibly contribute to the upper bound. With equal rates, all paths — or, at 
least, all paths that spend nearly all of their time at the sites with maximal rates 
— are essentially on an equal footing. This makes the combinatorial bounds more 
challenging. 

We will assume throughout this section that A = 0. The upper bound clearly 
can only be improved by making some elements of A positive. The lower bound 
can only be reduced by e max A^, which will be much smaller than any multiple of 
(loge“^)“^ for e sufficiently small. 

4.1. Trajectories. Instead of considering excursions, which emphasize time spent 
at 0 as the baseline, we will base our analysis on trajectories, which will simply 
be paths in the migration graph M. The set of all trajectories of length T will be 
denoted Tt- The set of changepoints of a trajectory / will be denoted 

K{f) := {t: ft ^ ft+i}- 

We write Tr.fc for the set of trajectories with exactly k changepoints, and we have 
(fc) < #3^T,k < endow with the L'^ norm || • || 2 , defined to be the 

square root of the Hamming distance (the number of times at which the trajectories 
are not equal). The null trajectory f^^'> will denote the path that stays at 0 for all 
T steps. 

We then have the random variables 

Zy:=/[X,A]:= ^ logMft+uft)+ Y. 

t&K(f) ie{0....,T-i}\A(/) 

(We will use the Zf notation for brevity when there is no need to emphasize the 
dependence on X and A.) 

We have the analogue of Lemma 6: 

Lemma 10. 

(63) logi?T(0,0) +log(l+ Y 

i=l fe3^^\{f(0)y 

where is the null trajectory. Thus 

(64) 

liminf max /[X, A] — K{f) loge“^ < a(e) — a(0) 

T->oo /CTt 

< lim sup flog + max /[X, A] — K{f) loge“^ 

T-)-oo /6 Tt 

4.2. Proof of the lower bound. We begin by assuming bounded above by a 
real number R. We may find paths 0 = jo, ji,..., jj = 1 and 1 = Jq, jj, • ■ • ,jj> = 0 
in M, with I + I' = ki. 
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Now consider the random variables 

S2—1' i—i 

Z(ti,t 2 ) := max| ^ ^ log j*) - 

t—si+I i—0 

1 ' 

+ logA2-/'+i(j;'+i, jO - N°b'+i: 

tl < Si < S2 < t2, S2 > Si + Hi — l| 
Note that for any fixed £ > 1, the collection of random variables 
{Z(£j,£(j + 1)-1) : j = 0,l,...,fc-l} 

are i.i.d., and 

k-l 

(65) max{Z/ : / e > 

j=o 

The distribution of Z{ti,t 2 ) depends only on t 2 — ti. This allows us to define 

( 66 ) re:=r^/^E[Z{£i,ei+e-l)]. 

It follows from (65) and the Strong Law of Large Numbers that for fixed £, 

(67) lim k~^ maxjZy : / e Jik,Kik} > • O 

k—>-oo 

for any £. We now fix £ = \2ki loge“^/r*]^, where L* is defined in Lemma 11. 
For any T > £ we then have 

T“^max{Z/- iL(/)loge“^ : / G Tt} > max{Z/- iL(/) log : fGJki} 

>T~^m&x{Zf : / G Tfc£,Kifc} - loge“\ 

since K{f) = mk for any / G 
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Proof. We consider, for definiteness, Z{0,£ — 1). We suppose first that is 
almost-surely bounded by some number R. 

By the definition of M and the fact that (At) are i.i.d.. 


^-1 

A* := essinfE|^^logAj(jj+i,ji) | ® +essinf logyl,(j'+i, j') | ® 

i=o ■ ~ 


I'-i 


i=0 


> —oo. 


That is, the expected value of either sum is at least conditioned on any real¬ 
ization of 

Define for 0 < t < 1 


Wt := 


V£a 




(1) 


Then 


Z(0,£- 1) = max I V£a(Wt> - Wt) -h V log Ap^j+i+i(ji+i, ji) - xi 

|<f<t'<i-|V Ti;) 

r 

7-1 r 

+ E^°sA+*(7+i.Ji) + E^°§A'--f'+**^7'.Ji-i)) 


do) 

lt7J+l+z 


2=0 


2=1 

7-1 7' 


2=0 


2 = 1 


where we define random variables ? = \ t't\ — 1 and i = [t£\ + 1, where t' and t are 
the locations at which Wt> — Wt is a maximum. We then have 


(69) 

E[Z(0,£- 1)] > E 


> E 


max fv7cr(Wt' — Wt)) 

i<t<t'<i-|V / 

■i-i 

'^\ogAt+^{j^+l,ji) 

■i=0 

max (V£a{Wt' — Wt)) 


■ ess inf E 


— k\R 




ess inf E 


I'-1 


^ logAt'+i(j*+i,i*) 


*- 2=0 


D 




“1“ A:^ - K\R. 


Note that the essential infimum is required in the second line because t and t' 
depend on D. 

By Donsker’s invariance principle {cf. Theorem 2.4.4 of [EKM97]) (Wt)o<t<i 
converges weakly (in supremum) to a Brownian motion (wt)o<i<ij so that 



max (a{Wti — Wt)) 

£—^oo 

■ 

E 

- > (tE 

max ujt> — CO t 


|<t<t'<i-|V / 




-I'+i 
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The maximum on the right-hand side is sometimes referred to as a “descent vari¬ 
able” of the Brownian motion.^ By Theorem 2 of [DSYOO] it follows that 
(70) 


lim E 
^—>•00 


max ( (jfWt' — Wt )) > crEf max ( max ujti — w#)] = \/8/7rlog 2 • a. 


Suppose now we do not necessarily have a bound on For any R > 1, 

2r(°) ^__r2 /2 ^(o) 

r(0) 


E[(Y:f^ - i?)l 

E^(^W2 _ ^2^^ 


{XJ“>>R}] - R ^ 


We now create new random variables Z^{0,£ — 1) by replacing every appearance 
of in the definition of Z{0,£ — 1) by 


(0) 


Xt,R :=RA Xr + , 

where R may now depend on £. (This changes xj;^^ as well as xj^K) We still have 
E[Y(^^] = 0, and Thus the variance cr|j of the new differs from cr^ by no more 
than 


E[(YfV] - nKR] = |E[(^ro" - Kr] 


(0)n2 


< E 




(0) 


■2RE 


{Xt R)l^^(o)^^^ 


■E 


(Yf) - 


< (6r(0)+4(T(°))2)e-«'/2x 

We see that 


e-i 


(0) 


>R} 


\Z{0,£-1)- Z^{Q,£-l)\ < -bE[(xf^ 

t=o 

so that 

|E IZ(0,£- 1)]-E [Z^(0,£- 1)] I < E [\Z{0,£- 1) - Z^{Q,£- 1)|] < 
Take R = log^. Applying (69) to Z^{(),£ — 1), we may conclude that 
liminf — 1)1 > lim inf a r^J S / t: \og2 — lim sup —— < 

I!—>oo ^ ^ >oo >00 R 

= (T\/8/7rlog2. 


-R^/2Z°'> 


□ 


4.3. Proof of the upper bound. We replace by max^/j/ At{i',j') V 1 for 

all i ^ j. This can only increase the value of a(e), so it sufhces to prove the upper 
bound under this new condition. Similarly, the upper bound will only be increased 
if we add to each X^^\ so it will suffice to prove the upper bound under the 
assumption that all are 0. We also write 

, . 

Co := 1 5maxf^^^ 


^In [DSYOO] it is referred to as the “downfall”, but this is an awkward translation from the 
Russian. In mathematical finance it is referred to as “maximum drawdown” [MIAPAM04]. 



GROWTH RATES WITH RARE MIGRATION 


25 


Note that each is sub-Gaussian, and for any /, /' e J't 

n—l 

Zf - Zf, = - {K{f) - K{f )) loge-i. 

i=0 

The first term is a sum of independent sub-Gaussian random variables with variance 
factor no bigger than 2maxj and the second term is deterministic, so that by 
Lemma 2 

\\Zf-Zf,y<dif,n :=Co\\f-fh. 

We now fix an increasing sequence of integers 1 = mo < mi < • • • < mj < 
mj+i = T, to be determined later, where we assume that mj = [T/2\. We define 
for J > j >0, 

(71) Zi := max{Z/ : / € |J 

mj<k<mj^i 

We may then use (63) to obtain 

(72) 

a(e)—a(0) < limsupT”^ logfu-y^ (m,+i— to,) [ ^ | e^^ +Te'^-^( ^ 

T^oo V ^ \mj+ij \mjJ ) 

Note that, in order to obtain an upper bound, we need to take a smaller power of 
e than any in the range [to^ , to^+i) that are combined into one term. 

To bound the Orlicz norm of Zi we use chaining, as described in [Pol90]. By 
Lemma 3.4 of [Pol90] we know that for any T* C 

(73) II maxZ/ll^ < ^ ^^^^2 + logi^(CoV^/2% T*), 

i—1 

where the packing number Dir, T*) is the maximum number of points that may be 
selected from T*, with no two of them having || • ||2 distance smaller than rjCo- 
(In principle there would be an additional term for the norm of Z^io), but that is 
identically 0.) 

The packing numbers for Tr.fe are difficult to estimate precisely, particularly for 
large k, but fortunately we can make do with fairly crude bounds, such as we state 
below as Lemma 12. Substituting (77) into (73), and using the fact that the bound 
is increasing in k, we see that for j < J — 1, 


D ( CoVf/T , U Tr.fe J < TTO.+id^^+i+i: 

\ mj<A;<mo + i / 


Te 


Te 


TOj + i ’ T/4* - TOj + i 


“1-2 6 


and 


dIcoVt/T, U Tr.fc j <T"(8edlogd)^ 

\ mj<k<T / 


mj<k^l 

Consider some fixed j < J— 1. If we let i* = [log 4 r/2TOj+iJ, then for i < i* — 1, 
and 2** > ^y^r/TOj+i > ^iJ\ogiTde/mj+i). 


ruj + i 
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So for j < J — 1 

\/T ! - 

wmu < T. logSm^+iTd + mj+i logTde/(4 — mj+i) 

i^l 

^ \/t I - 

+ X! ° 2 » ^J\ogSmj+iTd + mj+i log(rde/wj+i) 

i—i* 

\/t j - 

- X! ° 2 i ylog(8mj-+irrf) + irrij+i log4 - m^+i log(l - 4*TOj+i/r) 

+ 2I-1 V^og(^^i+i^^) +TOj+ilog(T(ie/mj+i) 

“ CiVt n- Civ^ /- , , -- 

< C2\/Tmj+i 

for some constants (71,(72- By choosing C 2 appropriately we may ensure that this 
bound holds as well for j = J. 

By definition of the Orlicz norm, stated as (2), this means that for 1 < j < J, 
E[exp{(Z^)VC2^Tm,+i}] <5. 

Applying Markov’s inequality we have 

¥{Zi > z^/Tn^,} < . 

Let z = max{l,(72}, and for any T > log^e“^ define to be the event on 
which Zi < ZyjTmj+i{j + 1) for all j. Note that 

00 

(74) P(^z.t) < -l) . 

j=i 


This bound is smaller than 1, from which it follows that P(A 2 _ 7 ’) > 0. 

We now take rrij := [4Tjz^/ log^ ej as long as this is < T/2, then set nij = [T/2J 
and mj+i = T. We note that J + 1 < (log^e)/4z^. By the constraint on z, we 
have for J — 1 > j > 1 


Te ^ elog^e/4 ^ log^ e 
"^3 ~ jz2-log^e/4r “ jz^ ’ 


so 



< 



4TizV log" e 


Similarly, we have on the bound 


Zi < zJTmj+i{j + 1) < 2z^(j + l)r/loge ^ 
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Substituting into (63) we see that on the event 
(75) 


a(e) -a(0) < Iminf r-Mog(l + 

i=i 


log e 


,2 \4Tjz^/log^e 


p2z^(i + l)T/ loge- 


+ T2(4g)T/2eTzv^7+Ty 


The summand may be bounded by 

r 2Tz^ 

while the additional term on the second line is bounded by 

T2(16e)^/'‘ < 1. 


Thus 

(76) 

a(e) - a(0) < liminfT-i ^log T + log(2T-i + (l - e^)~^ expj + 1 O 0 )|) 

_ 4^2 + 102 
loge“i 

on the event Az^t- Since the event has positive probability, and since a(e) — a(0) is 
almost surely constant, the bound holds with probability 1. 


Lemma 12. For any r and positive integers T, k, with \/T > r > k > 0, 

( 77 ) 

Proof. Let r' = (r/Co)'^. Suppose that r' > k, let j = [r'/k\, and let m = 
\T/j~\. Let T* be the set of ordered (non-decreasing) sequences of length k from 
{0,..., m — 1}, crossed with {0,..., d— and define a map {(j), ip) : 3^n,k —t 9^* 

by letting {f}i be the i-th coordinate where / changes, and defining 

■nf). = [IfiJ 

and ip{f)i = that is, the site that / moves to at its f-th change. 

If / and /' are two elements of with = 4){f') and ip{f) = ipif), 
then fi = f[ as long as \i/j\ ^ <('(/), since any t ^ ())(/) corresponds to a span 
of tj,tj + 1, ■ • ■, + j — 1 where ftj = f^j (because they started with /o = /g, 

and the number of changes in fo,..., ftj-i is the same as the number of changes 
in /(),•••, fii-i)- Thus d(/, /') < kj < r', meaning that ||/ - /'II 2 < CgV^ = r. 
By the pigeonhole principle, any subset of 'Jn,k of size greater than has points 
with |j • ||-separation no more than r. Hence 

D{r,Jn,k)<#^*= 

Combining this with the trivial bound D{r, ‘Jn,k) < ffAn.k = (^) and the bound 
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completes the proof. 


□ 


5. Simulations 

We illustrate the results with some simulation tests. We want to show that in the 
case where two sites have identical mean expected log growth (or in the diapause 
case) the changes in a are approximately like 1/loge“^ for e close to 0; and that in 
the case where the maximum expected log growth rate occurs at only a single site 
the changes are like a power of e, and that the power is approximately as predicted. 

5.1. Diapause case. We begin with a very simple 2x2 example: 

Mt = , At = , with St ~ Unif(0.05,0.99), Bt ~ Gam(5,2), 

with St and Bt independent. We have a(0) = i(E[logS't] +]E[logi?t]) = —0.0193. 


e 

1/ log e ^ 

a(e) 

0.500 

1.443 

0.305 

0.400 

1.091 

0.256 

0.300 

0.831 

0.206 

0.200 

0.621 

0.153 

0.100 

0.434 

0.097 

0.050 

0.334 

0.065 

0.010 

0.217 

0.028 

0.005 

0.189 

0.022 

0.001 

0.145 

0.012 

10-4 

0.109 

0.003 

10-5 

0.087 

-0.001 

10-6 

0.072 

-0.005 

0 

0.000 

-0.019 


Table 1. Simulated diapause example 


The results are tabulated in Table 1, for values of e down to 10“®. In Figure 1 
we plot a(e) against l/loge“^, and see that for small values of e the values are very 
close to a line (with slope approximately 0.2. 

5.2. Migration case. We now consider a 3 x 3 example: 



0 

0 ^ 

/O 

c 

1\ 

0 


0 

At{C) = 1 

0 

c] 

0 

0 


Vc 

1 

V 


with zl.^\ zj:^\ i.i.d. standard normal random variables, and (7 is a nonnegative 
constant. If C = 0 then the migration graph is a cycle of length 3, so ki = K 2 = 3; 
if C > 0 then = k 2 = 2. 

We consider four different cases for (^, cr^, C): I : (—0.1,0.5,1), II : (—0.1, 0.5,0), 
III : (-0.1,1,!), and IV : (0,0.5,1). In all cases a(0) = 0. The results of this 
paper predict that case IV should be like the diapause example in section 5.1, with 
a(e) behaving like c/loge“^ for some constant c, when e is small. 
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Figure 1. Simulated diapause example. 


The first three cases should have loga(e)/loge ^ converging to a constant as 
e I 0. We have = 0.1 in all three cases. For cases I and II we have = 

0.1, so that the power for case I is between 


2 • 2 • 0.1 = 0.4 and 


2 - 2 - 0.1 _ 1 
1 + 2-0.1 “ 3’ 


and for case II is between 

2 • 3 • 0 I 

2 • 3 • O.I = 0.6 and \ -\-2 —OT ~ ^ 


For case III is decreased to 0.05, so the power is between 


2 • 2 • 0.05 = 0.2 and 


2-2-0.05 _ 2 
1 + 2 • 0.05 “ IT' 


We plot some simulated results in Figures 2 through 4, plotting the loga(e) 
against loge”’^. In the limit as e —?• 0 this should approach a line whose slope is in 
the range given for the power of e in Theorem 5. We plot lines with those slopes 
in each figure, and see that in the lowest range of e (we take it down to e = 10“®) 
the slope comes down close to the upper limit, but is still higher. Of course, this is 
completely consistent with the true exponent being at the upper limit, particularly 
since we don’t know anything yet about how small e would need to be before the 
asymptotic slope becomes apparent. 
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Figure 2. Simulated migration example with path length 2. The 
red lines have slope 0.4 and 1/3. 
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Figure 4. Simulated migration example with path length 2, and 
cr^ = 1. The red lines have slope 0.2 and 2/11. 



